# One-Shot Learning
Kotoba Speech V0.1
Apache-2.0
Kotoba-Speech v0.1 is a Japanese speech generation model based on a 1.2B parameter Transformer, supporting text-to-speech and one-shot voice cloning.
Speech Synthesis
Transformers Japanese

K
kotoba-tech
23
16
Deplot
Apache-2.0
DePlot is a visual-language reasoning model capable of converting chart images into linearized tables, enabling few-shot reasoning when combined with large language models
Image-to-Text
Transformers Supports Multiple Languages

D
google
13.72k
298
Featured Recommended AI Models